Universal Dependencies for Afrikaans
نویسندگان
چکیده
The Universal Dependencies (UD) project aims to develop a consistent annotation framework for treebanks across many languages. In this paper we present the UD scheme for Afrikaans and we describe the conversion of the AfriBooms treebank to this new format. We will compare the conversion to UD to the conversion of related syntactic structures in typologically similar languages.
منابع مشابه
An annotation scheme for Persian based on Autonomous Phrases Theory and Universal Dependencies
A treebank is a corpus with linguistic annotations above the level of the parts of speech. During the first half of the present decade, three treebanks have been developed for Persian either originally or subsequently based on dependency grammar: Persian Treebank (PerTreeBank), Persian Syntactic Dependency Treebank, and Uppsala Persian Dependency Treebank (UPDT). The syntactic analysis of a sen...
متن کاملThe origin of Afrikaans pronunciation: a comparison to west Germanic languages and Dutch dialects
This paper aims to find the origin of the Afrikaans pronunciation with the use of dialectometry. First, Afrikaans was compared to Standard Dutch, Standard Frisian and Standard German. Pronunciation distances were measured by means of Levenshtein distances. Afrikaans was found to be closest to Standard Dutch. Second, the Afrikaans pronunciation was compared to 361 Dutch dialect varieties in the ...
متن کاملUsing the Corpus of Spoken Afrikaans to generate an Afrikaans chatbot
This paper presents two chatbot systems, ALICE and Elizabeth, illustrating the dialogue knowledge representation and pattern matching techniques of each. We discuss the problems which arise when using the Corpus of Spoken Afrikaans (Korpus Gesproke Afrikaans) to retrain the ALICE chatbot system with human dialogue examples. A Java program to convert from dialog transcripts to the AIML linguisti...
متن کاملUniversal Decompositional Semantics on Universal Dependencies
We present a framework for augmenting data sets from the Universal Dependencies project with Universal Decompositional Semantics. Where the Universal Dependencies project aims to provide a syntactic annotation standard that can be used consistently across many languages as well as a collection of corpora that use that standard, our extension has similar aims for semantic annotation. We describe...
متن کاملAutomatic alignment of audiobooks in Afrikaans
This paper reports on the automatic alignment of audiobooks in Afrikaans. An existing Afrikaans pronunciation dictionary and corpus of Afrikaans speech data are used to generate baseline acoustic models. The baseline system achieves an average duration independent overlap rate of 0.977 on the first three chapters of an audio version of “Ruiter in die Nag”, an Afrikaans book by Mikro. The averag...
متن کامل